arXiv:1507.00748vl [cs.DS] 2Jul2015 


Approximate Deadline-Scheduling with Precedence Constraints 


Hossein Efsandiari* MohammadTaghi Hajiaghyi* Jochen Konemann^ 
Hamid Mahini* David Malec* Laura Sanita 

July 6, 2015 


Abstract 

We consider the classic problem of scheduling a set of n jobs non-preemptively on a single 
machine. Each job j has non-negative processing time, weight, and deadline, and a feasible 
schedule needs to be consistent with chain-like precedence constraints. The goal is to compute 
a feasible schedule that minimizes the sum of penalties of late jobs. Lenstra and Rinnoy Kan 
[Annals of Disc. Math., 1977] in their seminal work introduced this problem and showed 
that it is strongly NP-hard, even when all processing times and weights are 1. We study the 
approximability of the problem and our main result is an 0(log fc)-approximation algorithm 
for instances with k distinct job deadlines. 

We also point out a surprising connection to a model for technology diffusion processes in 
networks that was recently proposed by Goldberg and Liu [SODA, 2013]. In an instance of 
such a problem one is given an undirected graph and a non-negative, integer threshold 9{v) 
for each of its vertices v. Vertices v in the graph are either active or inactive, and an inactive 
vertex v activates whenever it lies in component of size at least 9{v) in the graph induced by 
itself and all active vertices. The goal is now to find a smallest cardinality seed set of active 
vertices that leads to the activation of the entire graph. 

Goldberg and Liu showed that this problem has no o(log(n))-approximation algorithms 
unless NP has quasi-polynomial time algorithms, and the authors presented an 0(rk log(n))- 
approximation algorithm, where r is the radius of the given network, and k is the number of 
distinct vertex thresholds. The open question is whether the dependence of the approximation 
guarantee on r and k is avoidable. We answer this question affirmatively for instances where the 
underlying graph is a spider. In such instances technology diffusion and precedence constrained 
scheduling problem with unit processing times and weights are equivalent problems. 


1 Introduction 

In an instance of the classic precedence-constrained single-machine deadline scheduling problem 
we are given a set [n] n} of jobs that need to be scheduled non-preemptively on a single 

machine. Each job j € [n] has a non-negative deadline dj G N, a processing time pj G N as 
well as a non-negative penalty wj G N. A feasible schedule has to be consistent with precedence 
constraints that are given implicitly by a directed acyclic graph G = {[n],E)-, i.e., job i G [re] has 
to be processed before job j if G has a directed i, J-path. A feasible schedule incurs a penalty of 
Wj if job j is not completed before its deadline dj. Our goal is then to find a feasible schedule that 
minimizes the total penalty of late jobs. In the standard scheduling notation m the problem 
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under consideration is succinctly encoded as l|prec| '^WjUj, where Uj is a binary variable that 
takes value 1 if job j is late and 0 otherwise. 

Single-machine scheduling with deadline constraints is a practically important and well-studied 
subfield of scheduling theory that we cannot adequately survey here. We refer the reader to Chap¬ 
ter 3 of [23] or Chapter 4 of Uj , and focus here on the literature that directly relates to our prob¬ 
lem. The decision version of the single-machine deadline scheduling problem without precedence 
constraints is part of Karp’s list of 21 NP-complete problems and a fully-polynomial-time 
approximation scheme is known BM- The problem becomes strongly NP-complete in the pres¬ 
ence of release dates as was shown by Lenstra et al. [2T]. Lenstra and Rinnoy Kan [22| later 
proved that the above problem is strongly NP-hard even in the special case where each job has 
unit processing time and penalty, and the precedence digraph G is a collection of vertex-disjoint 
directed paths. 

Despite being classical, and well-motivated, little is known about the approximability of 
precedence-constrained deadline scheduling. This surprises, given that problems in this class were 
introduced in the late 70s, and early 80s, and that these are rather natural variants of Karp’s 
original 21 NP-hard problems. The sparsity of results to date suggests that the combination 
of precedence constraints and deadlines poses significant challenges. We seek to show, however, 
that these challenges can be overcome to achieve non-trivial approximations for these important 
scheduling problems. In this paper we focus on the generalization of the problem studied in |22j . 
where jobs are allowed to have arbitrary non-negative processing times, and where we minimize 
the weighted sum of late jobs. Once more using scheduling notation, this problem is given by 
11 chains I (and hereafter referred to as pDLS). Our main result is the following. 

Theorem 1.1. pDLS has an efficient O(Jog k)-approximation algorithm, where k is the number 
of distinct job deadlines in the given instance. 

We note that our algorithm finds a feasible schedule without late jobs if such a schedule exists. 

In order to prove this result, we first introduce a novel, and rather subtle configuration-type 
LP. The LP treats each of the directed paths in the given precedence system independently. 
For each path, the LP has a variable for all nested collections of k suffixes of jobs, and integral 
solutions set exactly one of these variables per path to 1. This determines which subset of jobs 
are executed after each of the k distinct job deadlines. The LP then has constraints that limit 
the total processing time of jobs executed before each of the k deadlines. While we can show 
that integral feasible solutions to our formulation naturally correspond to feasible schedules, the 
formulation’s integrality gap is large (see see AppendixjDjfor details. In order to reduce the gap, 
we strengthen the formulation using valid inequalities of Knapsack cover-type [I1[51II1|26] (see 
also 

The resulting formulation has an exponential number of variables and constraints, and it is 
not clear whether it can be solved efficiently. In the case of chain-like precedences, we are able to 
provide an alternate formulation that, instead of variables for nested collections of suffixes of jobs, 
has variables for job-suffixes only. Thereby, we reduce the number of variables to a polynomial 
of the input size, while increasing the number of constraints slightly. We do not know how to 
efficiently solve even this alternate LP. However, we are able to provide a relaxed separation oracle 
(in the sense of |5|) for its constraints, and can therefore use the Ellipsoid method m to obtain 
approximate solutions for the alternate LP of sufficient quality. 

We are able to provide an efficiently computable map between solutions for the alternate LP, 
and those of the original exponential-sized formulation. Crucially, we are able to show that the 
latter solutions are structurally nice; i.e., no two nested families of job suffixes in its support cross! 
Such cross-free solutions to the original LP can then be rounded into high-quality schedules. 

Several comments are in order. First, there is a significant body of research that investigates 
LP-based techniques for single-machine, precedence-constrained, minimum weighted completion- 
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time problems (e.g., see Ham [25], and also [7] for a more comprehensive snmmary of LP-based 
algorithms for this problem). None of these LPs seem to be useful for the objective of minimizing 
the total penalty of late jobs. In particular, converting these LPs requires the introduction of so 
called “big-M”-constraints that invariably yield formulations with large integrality gaps. 

Second, using Knapsack-cover inequalities to strengthen an LP formulation for a given covering 
problem is not new. In the context of approximation algorithms, such inequalities were used by 
Carr et al. [3] in their work on the Knapsack problem and several generalizations. Subsequently, 
they also found application in the development of approximation algorithms for general eovering 
and paeking integer programs m, in approximating eolumn-restricted covering IPs mu , as 
well as in the area of scheduling (without precedence constraints) [5|. Note that our strong 
formulations for pDLS use variables for (families of) suffixes of jobs in order to encode the chain¬ 
like dependencies between jobs. This leads to formulations that are not column-restricted, and 
they also do not fall into the framework of m (as, e.g., their dimension is not polynomial in the 
input size). 

Third, it is not clear how what little work there has been on precedence-constrained deadline 
scheduling can be applied to the problem we study. The only directly relevant positive result 
we know of is that of Ibarra and Kim US], who consider the single-machine scheduling problem 
in which n jobs need to be scheduled non-preemptively on a single machine while adhering to 
precedence constraints given by aeyelic directed forests, with the goal to maximize the total profit 
of jobs completed before a common deadline T. While the allowed constraints are strictly more 
general than the chain-like ones we study, this is more than outweighed by the fact that all jobs 
have a common deadline, which significantly reduces the complexity of the problem and renders 
it similar to the well-studied Knapsack problem. Indeed, we show in Appendix |B] that pDLS 
with forest precedences and a single deadline admits a pseudo-polynomial time algorithm as well. 
This implies that the decision version of pDLS is only weakly NP-complete in this special case. 
Given the strong NP-hardness of pDLS (as established in [22] 1. it is unclear how Ibarra and Kim’s 
results can be leveraged for our problem. 

It is natural to ask whether the approximation bound provided in Theorem 11.11 can be im¬ 
proved. In Appendix O we provide an example demonstrating that this is unlikely if we use 
a path-independent rounding scheme (as in the proof of Theorem [LI]). This example highlights 
that different paths can play vastly different roles in a solution, and be critical to ensuring that 
distinct necessary conditions are met. Thus, rounding paths independently can lead to many 
independent potential points of failure in the process, and significant boosting of success proba¬ 
bilities must occur if we are to avoid all failures simultaneously. This means, roughly speaking, 
that our analysis is tight and therefore our approximation factor cannot be improved without 
signihcant new techniques. Given the above, it is natural to look for dependent rounding schemes 
for solutions to our LP. Indeed, such an idea can be made to work for the special case of pDLS 
with two paths. 

Theorem 1.2. pDLS with two paths admits a 2-approximation algorithm based on a correlated 
rounding scheme. 

The proof of Theorem 11.21 is given in Appendix [C] and shows that the configurational LP 
used in the proof of Theorem 11.11 has an integrality gap of at most 2 for pDLS instances with two 
paths. This is accomplished using a randomized rounding scheme that samples families of suffix 
chains from the two paths in a correlated fashion instead of independently. The approach uses 
the fact that our instances have two paths, and extending it to general instances appears difficult. 

We point out that the emphasis in Theorem 11.21 and its proof is on the techniques used rather 
than the approximation guarantee obtained. In fact, we provide a dynamic-programming-based 
exact algorithm for pDLS instances with a fixed number of chains (see Appendix [^ for details). 
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Theorem 1.3. pDLS can be solved exactly when the number of chains is fixed. 

1.1 Deadline scheduling and technology diffusion 

As we show now, the precedence-constrained single-machine deadline scheduling problem is closely 
related to the technology diffusion (TD) problem which was recently introduced by Goldberg and 
Liu [9] in an effort to model dynamic processes arising in technology adaptation scenarios. In 
an instance of TD, we are given a graph G = {V, E), and thresholds 9{v) G {6i, ..., O^} for each 

V G V. We consider dynamic processes in which each vertex t; G D is either active or inactive, 
and where an inactive vertex v becomes active if, in the graph induced by it and the active 
vertices, v lies in a connected component of size at least 9{v). The goal in TD is now to find a 
smallest seed set S of initially active vertices that eventually lead to the activation of the entire 
graph. Goldberg and Liu argued that it suffices (albeit at the expense of a constant factor loss in 
the approximation ratio) to consider the following connected abstraction of the problem: find a 
permutation tt = {vi, ... ,Vn) of G such that the graph induced hy vi, ... ,Vi is connected, for all 
i, and such that 

S{Tr) = {vi : i < 9{vi)} 

is as small as possible. 

As Goldberg and Liu [9] argue, TD has no o(log(n))-approximation algorithm unless NP has 
quasi-polynomial-time algorithms. The authors also presented an 0(rA; log(n))-approximation, 
where r is the diameter of the given graph, and k is the number of distinct thresholds used 
in the instance. Konemann, Sadeghian, and Sanita m recently improved upon this result by 
presenting a 0(min{r, A;} log(n))-approximation algorithm. The immediate open question arising 
from [S] and [2U| is whether the dependence of the approximation ratio on r and k is avoidable. 
As it turns out, our work here provides an affirmative answer for TD instances on spider graphs 
(i.e., trees in which at most one vertex has degree larger than 2). 

Theorem 1.4. TD is NP-hard on spiders. In these graphs, the problem also admits an 0(log(fc))- 
approximation. 

The theorem follows from the fact that TD in spiders and pDLS with unit processing times, 
and penalties are equivalent. We sketch the proof. Given an instance of TD on spider G = {V, E), 
we create a job for each vertex u G P, and let d^ = n — 9{v) -|- 1, and p^ = Wy = 1. We also 
create a dependence chain for each leg of the spider; i.e., the job for vertex v depends on all its 
descendants in the spider, rooted at its sole vertex of degree larger than 2. It is now an easy 
exercise to see that the TD instance has a seed set of size s iff the pDLS instance constructed has 
a schedule that makes s jobs late. 

2 Notation 

In the rest of the paper we will consider an instance of pDLS given by a collection [n] of jobs. 
Each job j has non-negative processing time pj, penalty Wj and deadline dj. The precedence 
constraints on [n] are induced by a collection of vertex-disjoint, directed paths V = {Pi, ..., Pq). 
In a feasible schedule job j has to precede job j' if there is a directed j,/-path in one of the paths 
in P; we will write j j' to indicate j has to precede j' from now on for ease of notation, and 
j ^ j' if we furthermore have j j' . We denote the set of distinct deadlines in our instance by 

V = {Di ,..., Dfi\, with higher indices corresponding to later deadlines, that is, indexed such that 
Di < Di! whenever i < i' . We use the notation i{j) G [k\ to denote the index that the deadline of 
job j has in the set T>, so we have that dj = for all j G [n]. We say that a job is postponed 
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or deferred past a certain deadline Di if the job is executed after Di. Our goal is to find a feasible 
schedule that minimizes the total penalty of late jobs. Given a directed path P, we let 

P^j := {f € [n] : j ^ f} 

be the suffix induced by job j G [n]. We call a sequence S = {Si, S 2 , ■ ■ ■, Sk) of suffixes of a given 
path P £ V a suffix chain if 

P D 5i D ^2 2 • • • 2 

while a suffix chain could have arbitrary length, we will only use suffix chains with length k = \P\. 
Given two suffix chains S and S' with k suffixes each, we say S P S' \i Si ^ for all i £ [k]. If 
we have neither S P S' nor S' ^ S, we say that S and S' cross. Given two suffix chains S and 
S', we obtain their join S V S' hy letting (S' V S')i = S, U S'. Similarly, we let the meet of S and 
S' be obtained by letting (S A S')i = Si Ci Si. 

3 An integer programming formulation 

Our general approach will be to formulate the problem as an integer program, to solve its re¬ 
laxation, and to randomly round the fractional solution into a feasible schedule of the desired 
quality. The IP will have a layered structure. For each deadline Di £ V, we want to decide which 
jobs in [n] are to be postponed past deadline Di. We start with the following two easy but crucial 
observations. 

Observation 3.1. Consider a path P £ V, and suppose that j £ P is one of the jobs on this 
path. If j is postponed past Di then so are all of j’s successors on P. Thus, we may assume 
w.l.o.g. that the collection of jobs of P that are executed after time Di forms a suffix of P. 

Observation 3.2. Consider a path P £ V, and suppose that j £ P is one of the jobs on this 
path. If j is postponed past Di, then it is also postponed past every earlier deadline Dii < Di. 
Thus, we may assume w.l.o.g. that the collections Si, ..., S^ of jobs of P that are executed after 
deadlines Di < ■ ■ ■ < Dk, respectively, exhibit a chain structure, i.e. Si D S 2 [2 • • • 2 S^. 

Combining the above two observations, we see that for each path P £V, the collections of jobs 
postponed past each deadline form a suffix chain = Sf’ fj S 2 3 • • • 2 S^. In the following 
we let denote the collection of suffix chains for path P; we introduce a binary variable xs for 
each suffix chain S £ and each P £ V. In an IP solution X 5 = 1 for some S G if for each 
i £ [k] the set of jobs executed past deadline Di is precisely S,. We now describe the constraints 
of the IP in detail. 

(ICip At most one suffix chain of postponed jobs per path. Since a job can either be 
deferred or not, and there is no meaningful way to defer a job twice, we only want to choose at 
most one suffix chain per path P £ P. Hence we obtain the constraint: 

^ X 5 < 1 VP G P. (Cl) 

sesp 

(IC2p Deferring sufficiently many jobs. In any feasible schedule, the total processing time of 
jobs scheduled before time Di must be at most Dp, conversely, the total processing time of jobs 
whose execution is deferred past time Di must be at least P — Di, where P = Pj total 

processing time of all jobs. This is captured by the following constraints: 

^ > r - Pi Vi G [k] 

PeVS&SP 

where p'g is the total processing time of the jobs contained in Si. While the above constraints are 
certainly valid, in order to reduce the integrality gap of the formulation and successfully apply 
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our rounding scheme we need to strengthen them, as we now describe. To this end, suppose that 
we are given a chain 


F^ = F[DFi’DFi‘D...DF[ 


of k suffixes of deferred jobs for each path P £ V, and let F = {F^}p^-p be the family of these 
suffix chains. Suppose that we knew that we were looking for a schedule in which the jobs in F^ 
are deferred past deadline Di for all P £V. For each f € [A:], a feasible schedule must now defer 
jobs outside IJpe-p Ff of total processing time at least 

0*’-^ := max | (F - A) - J]] Pj,o \ . (1) 

We obtain the following valid inequality for any feasible schedule: 

E E ^ ^ ^ (C 2 ) 

P&v S&SP 

where S is the collection of all families of suffix chains for V (including the empty family), and 
where is the minimum of 0 *T ^nd the total processing time of jobs j that are in Si but not 
in Ff ; formally, for F £ S, i £ [k], P £ P, and S £ , we set 


i,F 

Pg := mm 




E pp® 

j(^Si\Fr 

falls into the class of Knapsack Cover (KC) inequalities [U O HU |26], and the above 
capping of coefficients is typical for such inequalities. 

All that remains to define the IP is to give the objective function. Consider a job j on path 
P £ V, and suppose that the IP solution x picks suffix chain S £ S^. Job j is late (i.e., its 
execution ends after time dj = Di(j)) if j is contained in the suffix We can therefore express 
the penalty of suffix chain S succinctly as 

^5 := Wj. ( 2 ) 

j€P:j&Si(p 

We can now state the canonical LP relaxation of the IP as follows 


mm 


E E : (lClD,(lC 2 ]),x>oi. (P) 

P£P sesp ] 

For convenience we introduce auxiliary indicator variables Uj for each job j £ [n]. Uj takes 
value 1 if j’s execution ends after time dj, and hence 

•= E 

where P is the chain containing job j. 


4 Rounding the relaxation 

0ur rounding scheme does not apply only to (suitable) feasible points for (P), but in fact allows 
us to round a much broader class of (not necessarily feasible) fractional points {U, x) to integral 
feasible solutions {U,x) of the corresponding IP, while only losing a factor of 0(logA:) in the 
objective value. As we will see in Section . being able to round this broader class of points is 
crucial for our algorithm. In order to formally describe the class of points we can round, we need 
to introduce the concept of canonical chain families. Informally, the canonical suffix chain for a 
path P defers each job j £ P as much as possible, subject to ensuring no job in P is deferred past 
its deadline. The definition below makes this formal. 
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Definition 4.1. Given an instance of pDLS, we let Cf be the longest suffix of path P that 
consists only of jobs whose deadline is strietly greater than Di. Jobs in Cf may be seheduled to 
complete after Di without incurring a penalty. We call 


:= D 




the canonical suffix chain for path P, and let C = {C^}p^p be the canonical suffix chain family. 


Our general approach for rounding a solution ([/, x) to program ([P]) is to split jobs into those 
with large Uj values and those with small ones. While we can simply think of “rounding up” Uj 
values when they are already large, we need to utilize the constraints (ICll) and (IC2I) to see how to 
treat jobs with small Uj values. As it turns out, in order to successfully round {U, x) we need it to 
satisfy the KC-inequality for a single suffix chain family only. Naturally this family will depend 
on the set of jobs with large Uj value. We can formalize the above as follows. 

Consider any instance X of pDLS, and let (C/, x) be a solution to (P). Define the set L of jobs 
that are late to an extent of at least 1/(7 log A:) for a parameter 7 > 0 (whose value we will make 
precise at a later point): 

L = {j : Uj > 1/(7 log/c)}. 

We now obtain a modified instance of pDLS, denoted Ip, by increasing the deadline for the jobs 
in L to P. Thus, jobs in L can never be late in the modified instance Ip. Note that since we do 
not modify the processing time of any job j G [n], we have that and 0 *’^ remain identical in 
Ip and I for all i, F, and S. Similarly, each job j G [n] has the same penalty Wj in X and Xp. 
Let C be the canonical suffix chain family for Xp. We are able to round a solution {U,x) as long 
as it satisfies the following conditions: 

(a) for each P €V, the set {5 G : X 5 > 0} is cross-free 


(b) {U,x) is feasible for a relaxation (P’) of dPl) that replaces the constraints (IC2I) by 

E WG[fc], (C2’) 

PeVS&SP 

where C is the canonical suffix chain family for the modified pDLS instance Xp. 


In the next section, we see how we can find solutions satisfying both of these conditions. 

Suppose {U,x) is a solution to (jP]) that satisfies (a) and (b). Obtain by letting x^ = xs if 
S makes at least one job j G [re] late in Xp, and let = 0 otherwise. Define U^ < U as in m 
(with x*^ in place of x), and note that {U^,x^) satisfies (a) and (b). Let us now round {U^,x^). 
We focus on path P ^V, and define the support of (U^,x^) induced by P: 

:= {5 G 5^ : x^ > 0}. 

As this set is cross-free by assumption (a), T'^ has a well-defined maximal element S* with S P S* 
for all S G (recall, S P S* means S defers no less jobs past every deadline Di than S* does). 
By definition, S* makes at least one job j G [re] \ L late. Since S* is maximal in it therefore 
follows that j is late in all S G . Using the definition of {U^,x^) as well as the fact that j ^ L 
we obtain 


Y. 4= E = < 

SeTP SeSP-.jeSi^j-) 


1 

7 log A: 


( 4 ) 


We let {U,x) = 7 log A: • (U^,x^) and obtain the following lemma. 


Lemma 4.2. (U,x) satisfies 

^xs<l ViG[A:],VPGP (CT) 

S&SP 

X] p'fxs Pjlogk ■ ViG[A:], (C 2 ) 

P&VseSP 
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where C is the canonical suffix chain family defined for the modified instance Zl of pDLS. 


Proof. Observe that since x = 7 log k ■ x, we can view constraint (ICl|l as being precisely ine qual- 
ity (HI with both sides scaled up by a factor of 7 log k] similarly, we can also view constraint (|C 2 |) 
as constraint (jC2’P scaled up by this same factor. Thus, the lemma follows immediately from 
inequality dH and the fact that (U^,x^) is feasible for (P’). □ 


We now randomly round (U, x) to an integral solution {U, x) as follows. For each P G P, we 
independently select a single random suffix chain S € using marginals derived from x, and set 
the corresponding xs = 1- In particular, we set x so that for all P € P and all S € we have 

^xs ifSGT^ 

1 — Y2s'eTP if P = . 


Pr[x5 = 1] = 


Since {U,x) satisfies (IClP . we can see that the above describes a valid randomized process. We 
run this process independently for each path P G P to obtain x. A job j G [n]\L is late if it is 
contained in level i{j) of the suffix chain S chosen for path P by the above process. Thus, we set 

Uj ■■= 

SeSP-.jeS^j-) 

We now claim that the expected value of IJj is precisely tjj. 


Lemma 4.3. For all j ^ L, E[Uj] = Uj. 

Proof. Let P be the path containing j, and consider a chain S G such that j G PiQ)- The 
probability for xs to be 1 is precisely xs, and hence it immediately follows that 
E[Ufi= Yl Pr[xs = l]= xs = Uj. 

s&sP-.j&Sif^j) SesP-.jeSi^j) 

□ 


The preceding lemma shows that the expected penalty of (P, x) in the modified instance Zl 
is exactly X]je[n]\L following lemma shows that the schedule induced by x postpones 

at least 0®’^ jobs past deadline Di for all i G [k] with constant probability. 

Lemma 4.4. With constant probability, we have 

Y > 0 *’^ Vi G [k], 

P&v sesp 

where C is the canonical suffix chain family for the modified pDLS instance Zl 
7 = 4 the constraint holds with probability at least 0.7. 

Proof, (of Lemma \4.4\ ) Our proof relies on two bounds on random variables. Before proceeding 
with the proof itself, we begin by stating the two required bounds for the sake of completeness. 

First, we need the following form of Bernstein inequality m- Let Xi,..., Xn be independent, 
nonnegative random variables uniformly bounded by some M > 0, i.e. such that Pr[W < M] = 1 
for all i. Then, if we let X = ^• Xi, we have that 

Pr|X<E[Xl-Al<exp(-^^^-^Ph^), (0) 

for any A > 0. 

Second, we need the Bhatia-Davis Inequality [3], which states that for any random variable 
X with support in the interval [m, M], i.e. such that Pr[m < X < M] = 1, we have 

Var(A:) < (M - E[A:])(E[X] - m). (7) 


(5) 

In particular, for 

















We now describe how we apply the above bounds to achieve the desired bound on the proba¬ 
bility that ([5]) is satisfied. Fix some i G [fc]. Define random variables {Xp}pg-p as 

Xp:= Y, Pf^s 

SesP 

where C is the canonical chain family defined for the modified instance of pDLS; let X = 
'Thp^'P Xp denote the sum of these random variables. We make the following observations on 
the random variables Xp: 

• for P, P' £ V, P ^ P', we have that Xp and Xp/ are independent since our rounding process 
made independent choices for the two paths; 

• for each P £P we know Xp is nonnegative, since and xs are always nonnegative; 

• for each P £V we can see that we always have 

Xp= pf^s < ^fpPf < 

sesP _ 

where the first inequality follows by constraint (ICll) . and the second inequality follows by 
the definition of ; and 

• the expectation of X = YlpepXp satisfies 

E[X] = E 


[ E E 




^ ^ p"f > 7 log • 0* 

Pev sesp ^ Per sesp 
The inequality above follows by constraint (jC2ll in Lemma l4.2t the second equality above 
follows by observing that we always have Pr[x^ = 1] = in our sum. 


With the above observations in hand, we apply the Bhatia-Davis Inequality ([7]) to get that 
Var(X) = ^ Var(Xp) < ^(0*’^ - E[Xp])(E[Xp] - 0) 


Per 


PGP 

< 0*’^E[Xp] 


PGP 

= 0*’^E[X], 

where the first equality follows by the fact that the variables Xp are independent. Thus, applying 
the Bernstein inequality Q with M = 0*’*^ and A = E[X] — 0®’*^ gives us that 

(E[X]-0®’^)^ \ 


Pr[X < 0®’^] < exp ( - 


< exp — 


2 (0®-CE[X] + (E[X] - 0®-^)0hC^/3) 

(E[X]-0®’^)^ \ 

(4/3)(0®-CE[X]) ) 


= exp 


3 /E[X]\ 


Qi,C \ ■ 


4V0hC^jv E[X]J 


The first inequality above follows from the previously mentioned application of the Bernstein 
Inequality, the second by observing 0®’^ > 0 always and gathering like terms in the denominator, 
and the equality follows by pulling a factor of (E[X])^ out of the numerator. As noted earlier, 
however, we have that E[X] > 7 log A: • 0®’^; taking 7 = 4 and substituting this into the above 
gives us that, in fact, Pr[X < 0®’*^] < Since the constraint ([5]) holds for a given i £ [/c] if 
and only if X > 0®’*^, by taking a union bound we can see that the constraint holds for all i £ [k] 
with probability at least 0.7. □ 


9 









For each P ^ V lei be the join of the suffix chain corresponding to solution x, and the 
canonical suffix chain i.e., suppose that X 5 = 1 for 5 G . Then 

= sy C^. (8) 

Clearly, is a suffix chain for path P. We use the following greedy algorithm to obtain a 
schedule. 

for i = 1 to /c do 

for all P G T* do 

Schedule all jobs in P \ Sf not already scheduled respecting the precedence constraints 

end for 
end for 

Schedule all remaining jobs respecting the precedence constraints 

Theorem 4.5. The schedule produced by the above algorithm is feasible. Furthermore, if 
holds, the schedule has cost at most instance 

Proof We begin by noting that the schedule produced by the proposed algorithm respects the 
precedence constraints of all P G P. This follows as is a suffix chain for all P G P, and hence, 
the algorithm schedules the jobs in \ Sf in iteration i for P G P in precedence order. 

Next, we show that whenever ([5|) holds, we have that the penalty of the schedule produced 
by our algorithm is at most ™ instance Xl. Note that the total processing time 

of all jobs scheduled after iteration i by our algorithm is 

E ^ f Pi) 

P£Pj(zsP P&v \s(iSP j&CP J 

a + E E Pi 

PGP jGCf 

> F - A, 

where F is the total processing time of all jobs. The first inequality above follows by the definitions 
of and S; the second by our assumption that ([5|) holds; and the third by the definition of 0*’^. 
This means, however, that all of the jobs scheduled during iterations 1, 2,..., f of our algorithm 
will be completed by time Pj. Now consider a job j € P\L with deadline dj = Pi(j). If j is late 
in the given schedule, then it must have been scheduled after iteration i{j). This implies that 
j G and so 

*5 = 1 - 

SGSP-.jeSi^j) 

Thus, if we let C be the set of jobs j that are late in the schedule produced by our algorithm, we 
can see that the penalty of that schedule in the modified instance Xl is 

jec j^L 

exactly as claimed. □ 

Corollary 4.6. The schedule produced by the above algorithm is feasible and incurs penalty at 
most Slogk ■ WjUj in the original instance of the pDLS with constant probability. 

Proof. (Proof of Corollary ra Recall how we arrived at the integral solution {U,x). Given a 
solution {U,x) for ([P]), we define the set L of jobs j whose indicator variables Uj have value at 
least 1/(7 log A:). We then obtained a modified instance Xl of pDLS by increasing the deadlines 
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of jobs in L to F. Assuming that (?7, x) satisfies constraint (IC2I) for the canonical suffix chain 

family C for Xl we then generated a new solution (C7, x) such that 

^ < 7log A: WjUj. 

j^L j&L j 

We then rounded our solution (U, x) to produce the integral solution {U, x) to the modified 
instance. 

Now, by combining Theorem 14.51 and Lemma [4.41 we can see that with probability at least 0.7 
we get a feasible schedule whose cost is at most modified instance, while setting 

the parameter 7 = 4. Consider how the cost of this schedule can change between the modified 

instance and the original instance: since our modihcation was precisely to set the deadline of 
every job j £ L to T, we know that the cost of our schedule in the original setting can be at most 
Ei^L^jUj + EjeL'>^j- 

Now, by Lemma [T3l we have that rcjC/j] = Thus, Markov’s Inequality 

gives us that ^ with probability at least 1/2; taking a union bound over 

this and our probability of our rounding procedure producing a feasible schedule, we can conclude 
that with probability at least 0.2, we produce a feasible schedule the cost of which in the original 
instance is at most 

i^L j£L j 

exactly as claimed. □ 


5 Solving LP ( iPj) 


While we have shown how we can express the pDLS problem as an IP in Section [3l and how we 
can round solutions to a weakened LP relaxation in Section 01 an important step in the process 
remains unspecified: how do we find a solution (U, x) to round? This is especially problematic 
as the LP has an exponential (in n) number of both variables and constraints, and it is not clear 
how to solve such LPs in general. In this section, we show that ([P]) has a compact reformulation 
when precedences are chain-like. 


5.1 An IP formulation with polynomial number of variables 

Using the specific shape of precedences, we show how the important suffix-chain structure of 
postponed jobs can be captured more compactly. This allows us to reduce the number of variables 
drastically while slightly increasing the number of constraints. Roughly speaking, our new LP 
decouples decisions on job-postponement between the layers in [/c]. 

The new IP has a binary variable x*- for every job j G [n] and for all deadlines Di G V. In 
a solution x*- = 1 if job j and all of its successors are executed after deadline Di and all of job 
j’s predecessors are executed before deadline Di. We can see that our definition of the variables 
X*- ensures the desired suffix structure on every path P £ V (see Observation ETD; so we need 
only add constraints to ensure that the chosen suffixes for a path P form a chain (in the sense of 
Observation 13.2p . 

(|D1I) At most one suffix of postponed jobs per path per layer. We want to choose at most 
one suffix of jobs to defer for each path P £ P on every layer i £ [k]. This yields the following 
constraint corresponding to dClD: 

^x*<l yP£V,i£[k]. (DI) 

j£P 
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(ID2h Deferring sufficiently many jobs. As in our previous IP, our new formulation has a 
constraint for each suffix chain family F £ S and each layer i £ [k]. As before, suppose that we 
were looking for a schedule that defers the jobs in Ff’ past deadline Di for all P £V, and for all 
i £ [k]. Then define 0*’^ as in ([T|), and let 

p*’-^:=min| ^ 

for every job j £ [n]. The new constraint corresponding to (IC2I) is now: 

E E Vie [A:], VFg5, (D2) 

P€Vj(zp\FP 

which enforces that the total (capped) weight of jobs deferred past Di, beyond what is deferred 
by F is sufficiently large. 

(|D3p Chain structure of postponed suffixes. We need additional constraints to ensure our 
new IP chooses suffixes of jobs to defer that exhibit the chain structure that characterizes feasible 
schedules. Consider i,i' £ [k], with i < i', and let j £ [n] be a job whose execution is postponed 
until after time Di'. Then as previously observed, its execution must also be postponed until 
after deadline Di, by the ordering of deadlines. We capture this with the following family of 
constraints: 

E - E Y ^ G Vi € [A: - 1]. (D3) 

j'-j'hj j'-j'dij 



We can now state the entire IP. As with dED, we introduce a binary variable Uj that takes 
value 1 if job j £ [n] is postponed past its deadline dj. For a given job j £ [n], we can define 
this variable in terms of the variables in our new IP as follows. Let dj = Dij^jy Then job j is 
postponed if it is part of a chosen suffix for some layer i > i{j). We can therefore set 

Uj := max x], = xY j 
i>iU) ^ ^ J ’ 

where the equality follows from (jD3p . The standard LP relaxation of the IP is: 


mm 


WjUj : (IDip . (ID2p . (ID3p . x > 0 


(P2) 


J&m 


In the case of chain-like precedences, we are able to obtain an efficient, objective-value pre¬ 
serving, and invertible map between fractional solutions to (|P2p and fractional cross-free solutions 
to dEI). 


Theorem 5.1. Consider {xs}sesP P&V’ 37 > 0, and let 

= E 

sesP-.Si=p^j 

for all i £ [k], P £ V, and j £ P. Then: (i) x satisfies eondition ()D3jl : {ii) x satisfies condi¬ 
tions (inii) if and only if x satisfies eonditions (imi); {Hi) for any F £ S, x satisfies (|D2I) for F 
if and only if x satisfies (IC2I1 for family F. Furthermore, the objective value of x in © equals 
that of X in (jPH). 

Proof. We begin by showing that equation ([9]) implies that x satisfies condition (ID3jl . Fix P £ P, 
j £ P, and i £ [A: — 1], and observe that 

E E E ^5= E ^5- 

j'-flj j'-j'lj sesP-. sesP-.jeSi+i 

Si+l=Pyj/ 
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If we apply the same transformation to Ylj' j'^j condition (ID3p is equivalent to 

^ xs< xs- ( 10 ) 

SG5^’:jeSi+i s&SP-.j&Si 

Since every S G is a chain we have (Sj+i C Si, and thus every summand on the left side of 
the inequality also appears on the right. Since x >1^, it follows that both inequality ifTOjl and 
constraint ()D3p hold. 

To see that (ii) holds, we focus on path P ^V, and i € [A:], and observe that 

Y .^3 = Y 1 Y 1 ^s= Y 1 

j&p jGP sesP-. sesP 
Si=Pyj 

It immediately follows that x satisfies m iff X satisfies m- 

Next, we show that constraint (C2) holds for x if and only if constraint (D2) holds for x. 
Consider a chain family F on V, and a layer i G [A:]. For P G "P and j G P, we can see that for 
any S G such that Si = Pyj we have 


i,F 


p- = mm 


If we apply 


followed by the above equality, we get that 


iP 


E E 

P€Pj&P\FP 


i^F ~i 

V = 


E E r E 


P&Pj&P\FP 


Si=Pyj 
i,F 


E E E ps 

PeVjeP\FP sesP-. 

Si=Pyj 

J,F. 


xs 


= E p's ^s, 

P&P S&SP 

where the final equality follows by combining the inner two summations and observing that 
= 0 whenever Pf D Si. Thus we have that x satisfies the constraints in (ID2p corresponding 
to F if and only if x satisfies the constraints in (IC2p corresponding to F. 

Finally, fix some P G P. Using Q and the definition of Uj, we see that 

E = E E ^ = E E E = E E 

jGP jePj'<j j&Pj'd^j SeSP-. jePsesP-. 

where the final equality follows simply by combining the two inner summations. Note, however, 
that if we change the order of the two summations on the right-hand side we obtain 

E E ^3^3= ^sxs- 

jeP sesP jeP-. SgSP 

Summing the above over P G P, it follows that x and x have the same objective values in (IP2I1 
and m, respectively. □ 

The above theorem immediately implies a natural algorithm for efficiently constructing a 
solution X to (|P2D from a given solution x to (jP|). It fails to show how to perform the inverse 
operation, however. We now provide the missing piece . 

Theorem 5.2. Given a solution x to (IP^ . we can efficiently construct a cross-free solution x to 
(|PP that satisfies condition @ from Theorem \5.1[ 


Proof. We begin by constructing the collection of suffix chains that lie in the support of our 
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claimed solution x for (0; we then describe the values x associates with each of the suffix chains 
in this collection; and finally we argue that x satisfies the claimed properties. 

Consider a path P € P, i G [/c] and a € [0,1]. Let 



and note that the non-negativity of x implies that Si{a) is a suffix of P. Furthermore, constraint 
(|D3p implies that 

Si{a) D S2{a) D ••• D 5fc(a), 

and S{a) is therefore a valid suffix chain for P. Furthermore, for 0 < a < a' < 1, and i G [k] one 
easily sees that 5*(a) 2 Si{a'), and hence S{a) and S{a') do not cross. With this we now easily 
bound the number of distinct members of the family {S(a)}Q,. Consider increasing a continuously 
from 0 to 1, and count the number of times changes occur. For each i G [k], we know that Si{a) 
can only take on |P| -|- 1 different values. Since a < a' implies S(a) P S{a'), it follows that Si{a) 
becomes smaller as a increases. Thus, as we increase a from 0 to 1, Si{a) can only change at 
most |P| times. Since any time 5(a) changes, at least one 5j(a) must change, we may conclude 
that 5(a) takes on at most k\P\ + 1 distinct values. 

Now we show how to construct a solution x for ([P]) with support {5(a)} q,. For suffix chain 
5 G we let 

xs '■= supja G [0,1] : 5(a) = 5} — infja G [0,1] : 5(a) = 5}; (12) 

if 5 = 5(a) for some a G [0,1], and we let xs ■= 0 otherwise. 

Finally, we show that Q holds, i.e. that 

= Z] ^5- 

S&SP-.Si=P^j 

Fix i G [k] and j G P, and consider the sum on the right hand side of the above equality. Recall 
that we previously observed that a < a' implies 5(a) P 5(a'); combining this with equation (fT^ . 
we can easily see that the sum we care about will, in fact, be a telescoping sum that simplifies to 

xs = supja : 5j(a) = Psj} — infja : 5i(a) = Pyj}- 

sesP:Si=Pyj 

By our definition of 5*(a), however, we can see that j G 5*(a) if and only if — “• 

Further, 5i(a) can only include elements strictly preceding j in P if Thus, we 

may conclude that 

supja : 5j(a) = Psj} — infja : 5i(a) = Psj} = ^ ^ Pj/ = Pj, 

j'dij j'^j 

exactly as required. □ 

5.2 Solving the relaxation 

The usual way to solve linear programs with an exponential number of constraints is to employ 
the ellipsoid method P. The method famously allows us to reduce the problem of solving ()P2p 
to that of efficiently separating an infeasible point from ()P2p . For a given candidate solution 
{U,x) to (IP2p . it suffices to decide (in polynomial time) whether it is feasible, and if not, return 
a violated inequality. We do not know how to solve the separation problem for (IP2I) . and it is in 
fact not known how to separate KC inequalities efficiently in general. 

We will overcome this issue following the methodology proposed in [5] and relying on a relaxed 
separation oracle. For this, we consider a relaxation (P2’) of (|P2I) where we replace constraints 
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(ID^ by 

X] Z] Vie [fe], VFg5'. (D2’) 

p<^rjeP\FP 

for a subset S' C S that initially contains only the canonical suffix chain family C for instance X. 

In an iteration, we apply the ellipsoid method to (P2’), and this generates a point {U,x) 
that is optimal and feasible for (P2’), but not necessarily feasible for (|P2p . Using the procedure 
developed in Section U] we now attempt to map {U,x) to a cross-free solution to (jP|). Specifically, 
for parameter 7 chosen there, we let 

L = {j : Uj > 1/(7 log/c)}, 

be the collection of jobs whose indicator variable is large; note that U remains constant under 
the correspondence between solutions to (jP]) and ()P2I) as given in Theorem 15.11 and so this is 
precisely the set L used in Section [H As before, we imagine increasing the deadline for the jobs 
in L to r to produce a modified instance Ip of the pDLS. 

We now check whether {U, x) violates (ID2h for the canonical suffix chain family C of instance 
Xp. In this case, we add C to S' , and recurse. Otherwise, we know that we can apply the lifting 
operation of Theorem 15.21 to obtain a new candidate solution (17, x) for ([P]) whose support is 
cross-free. By Theorem 15.II we can see that, while {U,x) may not be feasible for ([P]), it is feasible 
for the relaxation (P’) used in our rounding procedure, and further has objective value no larger 
than the optimum of (|P|). Thus, applying the rounding procedure of Section U] yields a solution 
to our instance of pDLS with penalty 0(7 log A:) times the optimal with constant probability. 

In other words, the process described above consists of applying the ellipsoid method for 
solving ()P2p with a separation oracle that might fail in providing a violated inequality: we stop 
the algorithm at the first moment that our separation oracle fails, so as to guarantee that the 
number of iterations (and therefore the size of S') is anyway polynomially bounded in the number 
of variables. The solution output at the end might be infeasible for ()P2I) . but (as discussed above) 
provides a lower bound on its optimal value and is feasible for the relaxation (P2’); this ensures 
we can lift it into a solution for (jP|) that has cross-free support and is feasible for the relaxation 
(P’), which are precisely the properties required by our rounding procedure. 
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A Constant number of paths 


In this section, we present an algorithm for solving the pDLS problem when chain-like precedence 
constraints can be modeled by a constant number of paths, i.e, \V\ is a constant. For each path 
P € V, let be a suffix of path P, and let V be the vector formed by these suffixes. Further, 
for a given path P and any suffix 0, let min(y^) be the first job in suffix according to 

the precedence order ^ on jobs, i.e. the job we would have to schedule first among those in . 

Now, we are ready to design a dynamic program for solving our problem. We define OPT(F, t) 
to be the smallest overall postponement cost we can incur on jobs in V, when we schedule them 
beginning at time t while respecting precedence constraints. In order to compute the value of 
OPT(y, t), we need to consider how we can schedule the jobs in V. In particular, consider the 
first job we choose to schedule. While we can choose this first job from any path P for which 
0, the precedence constraints enforce that it must always be the earliest job in i.e. it 
must be min(y^). Define cost(j, t) as 


cost(j, t) 



if dj <t + pj; and 
otherwise, 


to capture the cost of scheduling job j at time t. Then, we can express OPT(D, t) recursively as 

OPT(y, t)= min {cost{j,t) + OPTCV \ j,t + pj)), (13) 

PeV-.VP^ii 

where j = min(y'^) and we use V \j to denote the vector V excluding job j, i.e. we have 

-nP ^ ifjGP;and 

Iotherwise. 

Thus, if we abuse notation slightly and let V denote the vector of suffix chains that includes every 
job on every path, and 0 denote the vector taking the empty suffix on every path, we can see 
that taking base cases of OPT(0, t) = 0 for all t and computing OPT(P, 0) yields precisely the 
quantity we want to compute. 

Running time: While it might appear that the above recursion describes a dynamic program that 
runs in pseudo-polynomial time, due to the second parameter, we note that the first parameter 
always fully determines the second. In particular, for any recursive call OPT(y, t) made while 
computing OPT('P,0), a simple induction shows that we always have that 


Pr 

PerjeP\vP 

Thus, the number of values we need to compute is bounding by the number of possible vectors of 
suffix chains, which is Opepd-f^l + 1) = Since our recurrence takes the best among at 

most \V\ possibilities, we can immediately conclude that we can compute every value of OPT(D, t) 
of interest in time 0(17^1 


B Single Deadline 

In this section, we present a pseudo-polytime algorithm for solving the pDLS problem in the case 
where all jobs face a single, common deadline. The algorithm we give addresses not just chain-like 
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precedence constraints, but the more general case of precedence constraints forming a tree where 
jobs can have multiple predecessors (but still no more than one successor). This matches the 
setting of Ibarra and Kim m, and highlights the critical role of multiple, distinct deadlines in 
the pDLS problem: Lenstra and Rinnoy Kan [22] proved that the pDLS problem is strongly NP- 
hard even when all jobs have unit processing time and deferral cost, and precedence constraints 
are chain-like; with only a single, common deadline, however, the existence of a psuedo-polytime 
algorithm implies that the problem becomes only weakly NP-Hard, even in the more general case 
where precedence constraints form trees. 

We solve our problem by designing an appropriate dynamic program. Before proceeding with 
the details, we begin by giving some intuition and making necessary definitions. Limiting all 
jobs to share a common deadline D forces the problem to closely resemble the knapsack problem, 
and in fact our dynamic program follows in precisely that vein. With only a single deadline, 
our problem becomes one of deciding which jobs we choose to include before that deadline, and 
which we defer until after the deadline; in other words, we have a certain amount of time before 
the deadline occurs, and we need to decide what jobs we want to use that time on to minimize 
the cost of deferred jobs (or, equivalently, maximize the cost of on-time jobs). In the setting we 
consider, each job may have multiple predecessors but only a single sucessor. This implies that, 
if we consider any tree present in the precedence constraints, if we schedule a job j from that 
tree we must also schedule every job in the subtree rooted at job j. Thus, our task is to find a 
subforest of the forest of precedence constraints. If we focus on one tree, we can consider whether 
or not to schedule the job at the root: if we choose to schedule the root job, then we must schedule 
the entire tree; if we choose not to schedule the root job, then we may decide independently how 
(and if) to schedule jobs from the subtrees rooted at each of its children. In effect, we choose to 
either schedule the entire tree, and then recurse on the remaining trees, or replace the tree with 
one tree for each of the root’s children, and then recurse on the modified forest. 

In order to make the above intuition concrete, we need the following definitions. Let the 
collection [n] of jobs be indexed according to a pre-order traversal of the precedence constraint 
forest, and for any job j, let T(j) denote the subtree rooted j, i.e. all of j’s predecessors. Define 
next(j) = min{j -t- l,j -t- 1,..., n -|- 1} \ T{j). Note that this implies that every successor of j 
will have a strictly earlier index f < j, and the predecessors of j will be precisely those jobs with 
indexes in the range {j -|- 1, j -|- 2,... ,next(j) — 1}/. Further, let 

W{j)= ^ Wjv; and 
i'er(j) 

pu)= pf 

i'eTL) 

denote the total penalty and total processing of all jobs in T{j). Recall that every job j shares a 
single common deadline dj = D. 

We now describe our dynamic program. Let OPT(j, f) represent the minimum penalty we 
can incur if we only have t processing time in which to schedule jobs {j, j -|- 1,... , }. In order 
to compute this value, we need to decide whether or not to schedule job j in the time we have 
left before the deadline. If we do schedule j, then we must schedule all of the jobs in T{j). This 
uses up processing time P{j) but incurs no penalty, and we may then make independent decisions 
about all remaining jobs. If we do not schedule job j, then we incur a penalty of Wj but use no 
processing time, and then may make independent decisions about all remaining jobs. Thus, we 
may express OPT(j, f) via the recurrence 

OPT{j,t) = min{OPT(next(j),t - P{j)),Wj + OPT(j l,t)} , 
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with base cases of 


OFTij,t) = 


+00 if t < 0 ; and 

^0 if f > 0 and j = n + 1 . 

From the above discussion, we can readily see that the value of OPT(l,F)) will be precisely 
the minimum penalty that can be occured when scheduling all of the jobs in [n]. 

Running time: Since we will only need to compute OPT(j, t) for parameter settings j £ [n + 1] 
and t E [D], we can see that our dynamic program will require at most 0{n ■ D) distinct values 
to be calculated. Since each calculation is simply a minimum of at most two options, we can 
compute the optimal value recl,D in time 0(n ■ D). 


C Correlated rounding 

In this section, we consider whether we can replace our independent rounding scheme with a 
dependent one to improve our approximation ratio. We follow an approach similar to that of Carr 
et al. [S] for the knapsack problem. At a high level, the key idea in the rounding scheme of Carr 
et al. is to try and ensure that the possible solutions it might produce are as uniform in size as 
possible. In particular, they do this by ensuring that sets of items in each potential knapsack 
solution have size profiles that are as similar as possible. In our setting, this would correspond 
to trying to ensure that for every deadline Dj, the family of suffixes we defer on the set V of 
paths has size that is as uniform as possible; unfortunately, we are much more constrained when 
selecting suffixes than we would be when choosing items for a knapsack. 

Specifically, there are two key difficulties. First, since each solution can only use a single suffix 
from each path, and different paths can randomize over paths with very different size profiles, it 
may be impossible to ensure that every solution has a similar size profile. For example, if every 
path but one defers a negligible number of jobs past some deadline Di, then we cannot make our 
solutions any more uniform in size than the distribution on that single critical path. Second, even 
if we can ensure that solutions defer sufficiently uniform total size of jobs past a given deadline 
Di, we need a way to do this for all deadlines in V simultaneously, that is, we need a way to ensure 
our approach makes consistent choices for every deadline Di. This is problematic: even though we 
work with suffix chains and so know that, on average, the length of the particular suffix deferred 
on a given path decreases with the deadline Di, the rate of this decrease could differ greatly 
between paths. For example, one path P might defer much larger suffixes (on average) past some 
deadline Di than every other path in P] but if the size of suffixes deferred on P past deadline 
Dj+i becomes much smaller, while that of suffixes deferred on other paths remains relatively 
constant, the situation becomes exactly the reverse for deadline Since the relative sizes of 

suffixes deferred on each path can change quite dramatically between one deadline and the next, 
it becomes difficult to devise a scheme that ensures consistent choices for all of the deadlines in 
V. 

It turns out that the first concern above is not difficult to deal with; it is the second concern 
that causes difficulties. When we only have two paths, however, we can overcome the second 
concern as well. In this section, we describe a correlated rounding scheme for two paths that 
provides a 2-approximation for the case of two paths. While we could achieve the same factor for 
two paths using a naive approach, we are hopeful the technique we describe here can be extended 
to more paths without the approximation factor increasing linearly with the number of paths (as 
the naive approach’s factor would). 

We now describe our correlated rounding scheme for the case where we only have two paths, 
say V = {Pi, ^ 2 }- We follow the general outline as the rounding procedure of Section 01 with two 
major changes. First, we adjust the definition of the set L, replacing the filtering parameter 7 log k 
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by 2. Second, we modify our rounding procedure to be correlated, rather than independent, for 
the two chains Pi and P 2 . We discuss the two changes in detail below. 

Our first change in the rounding procedure is to adjust the filtering parameter that splits jobs 
based on whether their Xj values are large or small. In particular, we now consider Xj to be large 
only if it is at least 1/2, rather than when it exceeds l/ylog A; as in Section 21 This means the set 
L of late jobs defining our modified instance Xl is now L = {j : Uj > 1/2}. 

Our second change will be to adjust the distribution used in our randomized rounding proce¬ 
dure. Before, we rounded the variables for each path P G P independently; now, however, while 
our rounding scheme will continue to induce the same marginal probability distribution on each 
path, we modify our rounding process so that the random choices we make on the two paths are 
strongly dependent on each other. Retaining the same marginal distributions ensures the analysis 
of Section 21 remains valid up until Lemma [4.41 with only minor changes to accommodate the new 
definition of late jobs L. The major changes in our approach occur from that point on: we can 
replace the concentration result underlying that lemma with an averaging argument that leverages 
the dependence structure we have introduced into our rounding scheme (see Lemma IC.41) . This 
new argument allows us to prove a much stronger approximation guarantee than we obtained for 
independent rounding, which furthermore holds with certainty. Before detailing how we obtain 
our new guarantee, we will briefly review the initial steps of our rounding procedure. Since these 
initial steps only require that our new rounding procedure induces the same marginals on each 
path, we defer further details of the rounding scheme for now. 

We begin by briefly recalling the overall structure of the rounding procedure from Section 21 
suitably modified for our new definition of the set L of late jobs (see that section for full details). 
We start with an instance I of the pDLS, and a solution (U, x) for our linear program (|P|). Then, 
we define a modified instance X^ of pDLS in which the set L = {j : Uj > 1/2} of late jobs all 
have their deadlines changed to be T (so they cannot be late in any schedule). Finally, we focus 
on a relaxation (P’) of (jP|) in which the set of knapsack constraints (IC2D is reduced to just the 
ones corresponding to the canonical suffix chain family for the modified instance Xi. 

We now describe the process we use to round our initial solution {U,x). This process can 
be applied as long as {U, x) both is feasible for the relaxed program (P’) and has support {S' € 
: X 5 > 0} on each path P € V that is cross-free; recall that in Section [5] we saw how to 
produce exactly such a solution. Given a solution (17, x) with these properties, we first modify it 
to produce a new solution (U, x) where each job j G L is no longer late, and each job j ^ L is 
late to twice the extent it is in (17, x). Formally, this means we set Uj = 2 - Uj \i j ^ L and Uj = 0 
otherwise. We correspondingly set 


xs = 



if S makes some job j ^ L late; and 
otherwise. 


Finally, we randomly round the fractional solution (17, x) to produce an integral solution (17, x), 
in such a way that Pr[x 5 =i] = xs for every path P £ V and every suffix chain S G S^. 

The same arguments as presented in Section 21 ~ with only minor adjustments - give us the 
following critical properties for (17, x) and (17, x). The two lemmas below are direct analogs of 
Lemmas 14.21 and 14.31 respectively. We state them without proof, as they follow from the same 
arguments as those given for their counterparts in Section 21 we refer the reader to that section 
for details. 
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Lemma C.l. {U,x) satisfies 

^xs<l yie[k],yP£V (CT) 

sesP 

E E > 2 • 0*’^ Vi G [k], (C2) 

P&T S&SP 

where C is the canonical chain family defined for the modified instance II of pDLS. 

Lemma C.2. For all j 0 L, E[Uj] = Uj. 


With the above two lemmas in hand, we are now ready to describe the rounding scheme we 
use to obtain ([/, x) in detail, and prove our approximation guarantee. 

We round our fractional solution iU, x) to the integral solution (17, x) as follows. For a given 
P (z V, for every a G [0,1] we define the chain 

S^{a) := minis' G supp(x) OS^ : 

where supp(a:) = Upg-pIS G : xs > 0}. The minimum in the above definition is with respect 
to the partial order ^ on suffix chains; recall that since we know supp(a;) is cross-free, this is 
well-defined. If the set in the above definition is empty, i.e. we have that YIs&sp then 

we define S^{a) = C^, where C is the canonical chain family for the modified instance Xp. We 
now round x to x as follows. Draw a single uniform random variable a ~ 7/[0,1], and index the 
pair of paths in our instance as P = {Pi,P 2 }. Our rounding procedure needs to choose one suffix 
chain for each of the paths; we use a to correlate our choices in the following manner. For paths 
Pi and P 2 we select the suffix chains and — a), respectively. At a high 

level, our goal is to use a to correlate our choices on the two paths, pairing large suffix chains 
on one path with small suffix chains on the other, and vice versa. By balancing our choices on 
the two paths in this way, we ensure that the combined weight of the suffix chains we choose is 
always relatively large. We make this idea concrete by first defining the rounded solution (f7, x) 
and then formalizing the above observation as a lemma. 

As stated above, we want our rounded solution to schedule jobs on paths Pi and P 2 according 
to the suffix chains = S^^{a) and 5^ = — a), respectively, where a ~ 7/[0,1]. Thus, we 

set X 51 = X 52 = 1, and set X 5 = 0 for all other S G U . Correspondingly, for each job f 
such that j L, we set Uj = 1 if j G or j G respectively, depending on whether j G Pi 

or j G P 2 ; for all other j G [n] we set Uj = 0. The following lemma shows that the rounding 
scheme outlined above produces the same marginal probabilities for {U,x) as those in Section U 
thereby establishing the validity of Lemmas 1C.II and 1C.21 


Lemma C.3. When {U,x) is rounded to {U,x) as described above, we have that for all P ^ V 
and all S G , 


Pr[x5 = 1] 


xs 

1 - ^S' 


where C is the canonical suffix chain family for Is- 


if C^; and 
ifS = CP, 


Proof. First, observe that if we set X 5 = 1, then we must have had that X 5 > 0 (or S = 
for some P G P where C is the canonical suffix chain family for Zl)] thus, for all other S we 
immediately have that Pr[x 5 = 1 ] = 0 = X 5 . 

We begin by focusing on suffix chains S in the support of x. Recall that the fractional solution 
(P, x) we began with had cross-free support, i.e. we had that {5 G : X 5 > 0} was cross-free 
for all P G P. Now, we defined x so that for all S we have xs > 0 implies xs > 0. Thus, we may 
conclude (P, x) has cross-free support as well. Fix some P G P, and enumerate in sorted order 
as 5^ ^ ^ where m = |supp(x)| < |>S^|. For any £ G [m], we can compute Pr[xg«] 
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as follows. Recall that we set £ 5 ^ = 1 if and only if we had that = S^^{a) or — a) 

(for P = Pi or P = P 2 , respectively). Now, from our definition of S^{-), we can see that these 

equalities hold, respectively, if and only if we have that 

£-1 e £-1 £ 


Xgi! <a< x^i! or Xgi! < 1 — a < Xg^t. 

£'=\ £'=1 £'=1 £'=l 

Thus, the probability of selecting is precisely the probability of choosing an a in one of the 
ranges above. Recalling that a ~ W[0,1], that xs is nonnegative for all S, and that YIs&sp ^5^1 
always (by Lemma IC.ip . we can conclude this probability is, in fact, the lengths of the intervals 
in question. Since both intervals have the same length, in either case we get that 

£ £-\ 


Pr[x<j« = 1] = ^ Xg,: - ^ Xgt, = xgi, 

£'=1 £'=1 

exactly as desired. 

Finally, for each P G R, we consider Pi[xqp = 1], where C is the canonical suffix chain family 
for Xl. Again, by the definition of our rounding process we can see this happens precisely when 
CPi = 5 -P 1 

(a) or CP2 = 5P2(1 

— a). This two equalities holds, respectively, precisely when 

m m 


xs < Oi or X 5 < 1 — a. 

SeS^i 5e5^2 

As in the previous case, however, the nonnegativity of x and Lemma 1C.II allow us to conclude 
that 

m m 


0 < ^ xs, ^ xs <1. 

Sg<S^i 


Thus, since a ~ W[0,1], we can conclude that in either case we 

Pr[xcp = 1] = 1 - xs 

SesP 


exactly as claimed. 


have that 


□ 


i,c 


Lemma C.4. With probability 1, we have that 

PeP s^sP 

for all i G [k], where C is the canonical suffix chain family for Xl. 


(14) 


Proof. We obtain the desired bound by combining upper and lower bounds for the expected value 
of the sum on the left of inequality (fTT)) : E[^pgp YIsgSP ^s]- Fix some i G [A:], and let M be 
the quantity we want to lower bound, i.e. the minimum value that the sum inside the expectation 
achieves. First, we show that the maximum value this sum ever achieves is at most M + 
this immediate implies that the expected value of the sum is also at most M + 0*’*^. We then 
show a lower bound of 20*’^ on the expected value of the sum. Chaining these two inequalities 
together immediately gives us that M > 0*’^, exactly as desired. 

We begin by proving our claimed upper bound on the sum YlpepYlsesP P^s^ First, we 
simplify this sum by recalling details of our rounding procedure. For each path P £ V we set 
exactly one xs equal to 1 , and set all others equal to 0 , based on a uniform random variable 
a ~ AY[0,1]. Thus, our sum reduces to precisely the coefficients of the two variables in question; 
by the definition of our rounding procedure, we can see that we get that 


E \ '' 2,0 r\i.C I 2 ,C* 

2 ^ Ps xs>Q’ =Psi +Ps 2 : 


PeP sesp 

where 5^ = S^^{a) and — a). 
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The key to our upper bound is showing that for any two values of a, the resulting values of 
the sum + differ by at most 0*’*". To that end, fix 0 < a < a' < 1; set = S^^(a) 

and = S^^(l — a) (as before); and set S'^ = S^^(a') and S'^ = — a'). Now, by our 

definition of S^{-), we can immediately see that a < ol implies that 5^ ^ S'^ and 5^ ^ 5'^, and 
so S\ D 5'^ and Sf C S'^. From the definition of : however, we can then see that we have 
0 < Pgn < < 0*’^ and 0 < < 0*’*". 

The above immediately imply the slightly weaker pair of inequalities 


ixC ^ isC _' i%C I /^z.C 

Ps'i ^ Ps^ - Ps'^ + 0 ’ 


and 


< v"£ < v£ + 0*’^, 


which combine to imply that 

(Pgf +P 5 ?) < +P 5 S) + 


P g1 — PgH — Pg2 


and ) + 0*’*^. 

Recall that we chose M = T^iria'^p^-pYlsGSP P^s^= minQ,(p^f +p^?). Thus, if we let a* 
achieve this minimum value M, i.e. pick a* E argmin„(p^f + p^^), we can see that applying the 
two inequalities above in the regions [0 ,a*] and [a*, 1], respectively, immediately gives us that 

E[E E p'f^s] = Ws? + P*s?] < E[M + 0*’^] = M + 0'’^, 

P&VSgSP 

exactly as claimed. 

Our lower bound follows simply by combining Lemmas 1C.21 and 1C.11 In particular, we get 
that 

E pf^^^ = E E Epr[x5 = 1] = E E pf^s > 20 *’^, 

PGP sgSP pgP sesp PGP sesp 


where the first equality follows since the xg are all binary random variables, the second follows 
by Lemma IC.21 and the inequality follows by Lemma IC.ll Combining this with our previously 
found upper bound on this expected value, however, we can conclude that M + 0®’*^ > 20®’^ or 
equivalently M > 0®’*^, exactly as desired. □ 


Finally, we observe that we can use the same procedure as outlined in Section [4] to produce 
a feasible schedule for X from the integral solution {U,x). As before, on each path P E "P we 
defer all jobs that are either in the suffix chain S E such that xg = 1 or in the canonical 
suffix for P in Xg, i.e. all jobs in the join S A C^, and then running the algorithm outlined in 
that section to get a feasible schedule. We get the following results corresponding to Theorem 14.51 
and Corollary 14.61 since their proofs follow from largely the same arguments as those given in 
Section 01 we only sketch the differences below. 

Theorem C.5. Applying the process and algorithm from Section^ to the solution (JJ,x) yields 
a feasible schedule with cost at most Ylj^g '^fij instance Xg. 

Proof. The proof of this theorem follows from the exact same argument as that for Theorem 14.51 
The only difference is the equivalent of constraint ([5]) for our current rounding scheme holds with 
certainty, due to Lemma fC.41 Specifically, we have that 

E E Pf^s > 0*’^, 

P&VsesP 

holds with probability 1, and hence our upper bound on costs holds unconditionally, unlike its 
counterpart from Section 01 In all other aspects the theorem and its proof are identical to those 
found in Section 01 and we refer the reader to that section for further details. □ 


Corollary C.6. The schedule produced by following the process and algorithm from Section^^is 
feasible in the original instance X of pDLS, and incurs expected penalty of 2Y^j wjUj. 
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Proof. The proof largely follows the same arguments as that of Corollary 14.61 and we refer the 
reader to that proof for full details. We sketch the main technical details below. Similarly to that 
section, since we defined L = {j : Uj > 1/2}, and set Uj = 2Uj for all j ^ L, we get that 

j^L j£L j 

Now, from Theorem 1C.5l we have the schedule we produce in the end has cost at most Ylj^L 
in the modified setting Recall, however, that the only difference between X an X^ is that in 
the latter we increased the deadlines of all jobs in L to be T. Thus, the cost of our schedule in 
X can increase versus the cost in X^ by at most the total penalty of all jobs in L, i.e. by at most 
Wj. Applying Lemma 1C.21 we can thus see that the expected cost of our schedule in Xl is 
at most 

E[^ Wj Uj + Wj] = Y Wj Uj + Y^^-‘^Y^ ’ 

j^L iGL j^L j 

exactly as claimed. □ 


D LP gap example 


In this section, we show that using the KC-inequalities (jC2p in the LP relaxation ([P]) is critical to 
our approximation factor. In particular, we show that without utilizing KC-inequalities, the inte¬ 
grality gap of (P) would be n(n/ log n). We do so by constructing a simple instance X of the pDLS, 
and considering the program (P) without the KC-inequality strengthening of constraint (IC2I1 . We 
will show that the LP relaxation of this weaker version of (P) admits a fractional solution whose 
value is a factor of n(n/log/c) better than any integral solution. Of special note is the simplicity 
of the instance X we construct: the instance consists of a single path, all of the jobs on which 
have unit process time and unit weight. 

In the rest of the section, we describe the instance X of pDLS and the fractional solution 
([/, x) yielding our claimed integrality gap. As mentioned, our example consists of a single path 
P = 1^2^---^n, with Wj = pj = 1 for all j € [n]. Each job has one oi k = nj^ distinct 
deadlines, with the deadline of job j being given by 

j if j is odd; and 
j — 1 if j is even. 

Thus, we can see that deadline Di = 2i — 1 for all i G [A:], and that i{j) = 

Since our example consists of a single path, there is only one feasible solution: simply schedule 
the jobs in the order they appear on that path. We chose our deadlines such that this schedule 
runs jobs with odd indices on time, and make jobs with even indices late. This results in a total 
cost of n/2. 

On the other hand, consider the following fractional solution to program (P) for the given 
example0 The support of our solution will be the set {S^}i of suffix chains indexed by £ G [A: -|-1], 
where we define by 

_ f {1, 2 ,... , re} if z < £; and 

* I {2z-|-1, 2z-|-2 ,..., re} if z > £. 


dj — 


^in the interests of simplicity, for the rest of this discussion we omit any uses of the index P and use < in place 
of ^ since, by construction, they are equivalent. 
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Our proposed solution then sets 

2 


Xs = \ 


2£ 

J_ 

2k 

0 


2 (£+l) 


if 5 = 5°; 

if 5 = for 1 < £ < A:; 
if 5 = 5^ and 


otherwise. 

Note that this solution has far lower cost than the integral one. In particular, for any j G [n], the 
suffix chain makes job j late if and only if j € ^iQ)- Whenever i{j) <£, we have j € Sj = [n]; 

on the other hand, for i{j) > i we can see that the smallest element of is + 1 > j + 1 , 


since i{j) = So j € if and only if i{j) < 1. Thus, we have that 


fc-i 


^3 = 


S&S:Siy)Bj (=iU) 


e=i{j) 


1 


2£ 2{i + l) 


1 1 
+ ^- 7 - 


The hrst inequality above follows by the definition of xs', the second by our dehnition of the 
support of X and our observations above on the third by our choice of x; and the last by 

observing that we have a telescoping sum, and recalling that we have 2i{j) = > j. Thus, 

we can see that the solution {U,x) produces objective value 

WjUj <Y~ = O(logn). 

jG[n] jG[n] 

Thus, any program with integrality gap better than n(n/logn) 

this candidate solution {U, x) violates. We next show that {U, x) 

program (P), as well as the version of constraint (C2) which does not use the KC-inequalities 

strengthening. This implies that the KC-inequalities are critical to achieving a small LP gap. 

We begin by showing that (U,x) satisfies constraint (Cl) from the program (P). First, note 

that we have defined x so that 

t=k , £=fc—1 


must have constraints which 
satishes constraint (Cl) from 


xs^-i^+ Y (vp 


p=i 


1 


1 


2 £ 2 {£ + 1 ) 




ses e=o 

and so constraint (Cl) is satished. 

Since our fractional solution {U,x) satisfies (Cl), we conclude constraint (C 2 ) is crucial to 
bounding the integrality gap of (P). Furthermore, we will now show that the “capping” of pro¬ 
cessing times by 0*’-^ is critical to achieving a good integrality gap for (P). We do so by 
demonstrating that without the capping operation, {U,x) would, in fact, satisfy constraint (C2), 
and hence show the integrality gap of (P) is n(n/logn). 

Consider constraint (C2) without the capping operation. Fix some suffix chain F on V. Using 
the previously given definition, for each i € [k] we have that 

0 *T — niax{n — Di — \Fi \ , 0 }. 

Furthermore, if we no longer enforce that the processing times we associate with suffixes be at 
most 0 *’^ we get that 

E PJ = \S^\F^\. 

j&Si\Fi 


i,F 

Ps = 


Fix i £ [k]. For constraint (C2) to hold in our setting, we need that 

Ps xs>&’ • 

SeS:Si2Fi 

We now show that the above always holds; we break our proof into three cases. 


Case: |F)| > n — Di. Then we have that 0*’^ = 0, and the inequality holds trivially. 
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Case: \Fi\ = n — Di — 1. Then we have that 0*’^ = 1; furthermore, since Di = 2i — 1, we 
can see that \Fi\ = n — 2i, and so Fi = {2i + 1, 2i + 2,..., n}. Recalling the definition of Sf, 
however, we can see that S'f D Fi only if i < £, and so = [n]. Thus, we get that 


E 


''fc-i 


i,F 


Ps = '^P^’fxse = IN \ Fi\ 

SGS'.Si^Fi i=i \i=i 


1 


2(^ + 1) J 2k 


+ 




Thus the desired inequality holds in this case as well. 

Case: \Fi\ < n — Di — 2. Then we have that for all 0 < £ < A:. So can see that 

i—1 k 

Y Ps^^S = “ Wi\)xsl 


SeS:Si^Fi 


e=o i=i 

k i—1 

= {n- Nil) Y 

1=0 £=0 

1 


= (n - INI) -2i{ 1 - 


2i 


= {n- INI) - ( 2 i - 1 ) 
= n - Di - \Fi\ 


= 0 


iF 


exactly as required. 


In every case, we get that constraint (C2) holds. Since our choice of F was arbitrary, we may 
conclude that requiring < 0*’^is critical to ensuring a good integrality gap for (P). 


E Tightness of rounding scheme 


Here, we give an example to show the limits of our current techniques when implemented with 
independent rounding. We construct an instance X and a fractional solution (?7, x) for X such that 
if we use the rounding procedure outlined in Section 4, while setting 7 = 0(l/log^fc) for some 
e > 0, then our probability of success will be o(l) in k. In other words, if we want our rounding 
procedure to succeed with constant probability, we simply cannot replace our boosting factor of 
ylogA: with one that is 0(log^“^ k). This shows that the result of Lemma 14.41 is tight, and so the 
approximation factor in Corollary 14.61 cannot be improved without significant new techniques. 

We begin by constructing the instance X of pDLS. Fix the number of deadlines A, and let n 
be the number of jobs for some n divisible by Each of our jobs will have unit runtime and 
unit weight, i.e. Wj = pj = 1 for all j € [n]. 0 ur set of deadlines will he V = {Di ,..., where 


Di = 




for all i £ [k]. 


Our set of paths V contains n/k identical paths of length A; each path contains a single job with 
deadline Di for each i £ [A], in increasing order along the path. Specifically, if one of our paths 
is P = {ji, j 2 , • • •, Jfc}, with ji F ■ ■ ■ < jk, then for all I £ [A] we have that i{ji) = i. In the 
following, we always index the jobs in a path P in this fashion for convenience, so that for any 
such path we always have that ji -< ji/ if and only if i < i', and that the deadline of job ji is Di 
for all i £ [A]. 

We now describe a fractional solution (U, x) for X, parameterized by 7 . For every path P £V, 
we will have only two suffix chains of P in the support of x. 0ne will be the canonical suffix 
chain for P, and the other will be a slight modification of the canonical suffix chain, which 
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we denote . Before defining these two suffix chains formaily, we first note that we wifi piace 
the majority of our solution’s weight on , setting 

' 1 - if 5 = C^- 


xs = S 


27 log k 


if 5 = C^; and 
otherwise. 


27 log k 

0 

Since by definition the canonical suffix chain family never makes any job late, this solution ensures 
that Uj < 1/27 log A; for all j G [n]. Thus, we can immediately conclude that for the solution 
{U,x), we have 

L = {j e [n] : Uj > 1/7 log A:} = 0, 


and so II = I- Thus, our rounding procedure works solely with the original instance Z, and the 
canonical suffix chain family C for X. ^ 

We now formally define the canonical suffix chain family C, the modification C, and the 
KC-Inequalities for Z corresponding to C. First, by inspection we can see that for any path 
P = {ji, J2, • • •, jfe} € V, we have that 

= {J2, ■■■ ,jk}Z {J3, {jk} 2 0, 

i.e. we have that Cf = {7+i,7+2, • • •, Jfc} for all i G [A;]. Now, we define the modification C 
as follows. First, we partition the set V of paths into k groups Hi, 112 ,..11^, each containing 
exactly n/k‘^ paths. Then, for any path P G P, we define the suffix chain as 

^ f if P ^ Hi; and 

* ifPGB,, 


where we take Cq to indicate the entire chain P. From the definitions of C, C, and x given above, 
we can compute that for each path P = {ji, ■ ■ ■ ,jk} ^P-, and each i G [A;], we have that 


Un 


27 log k 

0 


if P G Ilj; and 
otherwise. 


Thus, as previously mentioned, we can see that no job is made late to an extent of l/ylogA: or 
more, and so Xp = Z. 


Lemma E.l. For the described instance Z of pDLS, the constructed solution {U,x) satisfies 
conditions (a) and (b) of the rounding procedure of Section\^ i.e. the solution has cross-free 
support and satisfies the reduced constraint {C2’), whenever we have n/k"^ > 2'^\ogk, where 7 is 
the parameter for the rounding process. 

Proof. We begin by showing that condition (a) is satisfied, i.e. the constructed x has cross-free 
support. Fix some P G P. Now, we defined x such that X5 > 0 if and only if S G {C^ 
for all S G . Recall, however, that for all i we either have that C(‘ = Cf, or have that 
Cf = Cf_i D Cf. Thus, we may conclude that P for all P G P, and hence the support 
of X is cross-free. 

Now, we show that condition (b) is satisfied, i.e. the KC-Inequalities corresponding to the 
canonical suffix chain family C for the modified pDLS instance Xp are satisfied. Recall, however, 
that we already saw that Uj < l/ylogA; for all j G [n], and so Xp = X; thus, we are actually 
interested in the KC-Inequalities corresponding to the canonical suffix chain family for the original 
pDLS instance X. 

We begin by calculating the relevant constants for the KC-Inequalities. First, we compute 
0®’*^. Recall that all of our jobs had unit processing times, and so we have pj = 1 for all j G [n] 
and F = n. We claim that this gives us that 0®P = 1 for all i G [A:]. Fix some i G [A:]. Now, 
note that all of our paths are identical, and for each path P = {7, • • •, jfc} G P we have that 
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Cf\ = |{ii+i,.. ■ ,jk}\ = k — i. Thus, we can see that 

• n + 1 

Now, we consider the values of and P~p- First, we note that since denotes the (pos¬ 
sibly capped) number of jobs suffix chain S defers in addition to those deferred by the canonical 
suffix chain family C, we immediately can see that p^p = 0 always. Furthermore, since for each 

path P & V, C differs from C only in that it defers a single additional job past deadline where 
P € n^, we conclude that 



0*.c ^ (r - A) -\P\{k-i)=(n- 


lO 


if P G n,; and 
otherwise. 


Combining the above, we can see that the KC-Inequality corresponding to the canonical suffix 
chain family C holds whenever njl? > 27 log A:, exactly as claimed. To see this, we first compute 
that, for any i G \k] we have 
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where the first inequality follows by recalling that xs > 0 only when S G {C^, C^}, and p^p = 0; 


the second follows since 


P. 


i,C 

CP 


1 if and only if P G Ilj and is 0 otherwise; and the last since 


jlljl = njl?. Thus, since we already saw that 0®’*^ = 1, we can substitute in our computed 
values and rearrange terms to get that the (P, x) satisfies the KC-Inequalities corresponding to 
the canonical suffix chain family C if and only if re//c^ > 27 log k. □ 


Lemma E.2. Let ^ be a function of k, such that 7 = 0{l/log^ k) for some e > 0. For the 
instance Z of pDLS described above, with n = [27 log/c], applying the rounding procedure of 

Section^ to {U,x) using jlDgk as the boosting parameter has success probability that is o(l) in 

k. 


Proof. We begin by noting that the parameter settings outlined above are consistent with our 
example so far; in particular, we chose n to be divisible by kf, and furthermore such that n/k'^ = 
[27 log/c] and so the condition for Lemma lE . 1 1 holds. 

In order to calculate the probability of the rounding process succeeding, we begin by describing 
the conditions for it to succeed. Our claim essentially states that Lemma 14.41 from Section 0] is 
tight, and we build on the analysis used to prove that lemma. Considering that lemma, we see 
that for each i £ [k], one random variable Xp is defined for each path P G P as 

Xp:= p"fxs; 

SgSP 

xs is a random variable obtained by first modifying the solution (P, x) to produce a new solution 
(P,x), and then using the value of x to define the marginal distribution for x. While we refer 
the reader to Section 0] for the full details, we briefly describe the results of this process for our 
specific solution (P, x) in the instance Z. First, we note that from the definition of (P, x) in this 
section and the method for producing x, we will have that xgp = (ylogfc) • (l/27logfe) = 1/2 

and xg = 0 for all other S G \ {C^}. Second, we note that = 1 if P G Ilj and equals 

o 

0 otherwise for all P G P (see the proof of Lemma IE.II for details). Thus, if we consider the 
rounding process used to produce x, we will see that for each P G Ilj, Xp will be a binary random 
variable which takes values 0 and 1 with equal probability; and for each P £ V \ Xp = Q 
always. Furthermore, the random variables for each P G P are independent. 

Now, the rounding procedure succeeds for deadline Pj if and only if the sum X = Ylpep^P 
of the above variables is at least 0®’*^. Now, for the setting Z we have that 0®’^ = 1 (see the proof 
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of Lemma lE.il for details). As we saw above, however, Ap is identically 0 whenever P € L’ \ Ilj, 
so we conclude that the rounding process succeeds for deadline Di if and only if 

A = ^ Ap > 0*’^ = 1. 

PeHi 

Recalling that each of the Ap above is independently 1 with probability 1/2 and 0 otherwise, we 
can see that ^ 

Pr[A>l] = l-Pr[A = 0] = l- Pr[Ap = 0] = 1 - ' = 1 - 

and so the probability our rounding scheme succeeds for deadline Di is precisely 1 — 2“”/^ . 

Now, given the above, we want to compute the overall probability that our rounding scheme 
succeeds. In fact, this probability is precisely the probability that our rounding scheme succeeds 
for all of the deadlines in T>. Above, we saw that the probability our scheme succeeded for any 
single deadline was 1 — 2“"/^ ; we must be careful, however, because while our rounding scheme 
is independent for each path P £ V, there is dependence in its behavior for a given path P £ V 
with respect to the different deadlines Dp The key observation, however, is that the probability 
we succeed for deadline Di only depends on the random variables associated with paths P £ Hi. 
Since Hi ..., 11^ partition the set V of paths and the variables associated with each path P £V are 
independent for every deadline, we have ensured by construction that the events that we succeed 
with respect to deadlines Di and Dp will be independent whenever i ^ i'. Thus, we conclude 
that the probability our rounding procedure succeeds for ([/, x) in the instance Z is precisely 



Finally, we show that the probability we computed above is o(l) when we have that 7 satisfies 
7 = n(l/log^(A:)), some e > 0 , and n = [27 log/c]. First, we note that since 7 = 0 {l/^\og^ k), 

we must have that as k goes to infinity, either 7 log k either converges to some hxed constant 
c > 0, or goes to inhnity as well. 

First, if 7 log A: —c, some c > 0 , as A —>■ 00, we immediately get that the success probability 
above is o(l). This is because we can see that for all sufficiently large A, we have that n/A^ = 
[27 log A] < 2 c + 1 . Thus, we can bound our probability of success as 
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since c > 0 is a constant independent of A, we may conclude that our success probability goes to 
0 as A goes to infinity. 

Second, we show that if 7 log A —>■ 00 as A —>■ 00, we again get that our success probability is 
0(1). First, we note that this implies we must have that n/A^ —)• 00 as A —)• 00; thus, we may 
conclude that for all sufficiently large A, we have that 
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k 


( 15 ) 


Thus, if we want to show that our success probability is o(l), we need only show that the fraction 
A/ 2 "'/*^ ^ 00 as A 00. Note, however, that since 7 = 0 ( 1 / log‘" k), we know (again, for 

sufficiently large A) that n/k‘^ = [27 log A] < lg^“^/^ A, where IgA is the logarithm base 2 of A. 
Thus, we can conclude that when A is sufficiently large we have that 


A 


> 




= A 


1- 


lg=/2 k 


( 16 ) 


Now, since 


00 as 


—>■ 0 as A —>■ 00, we may conclude from equation (fTUI) that 
A —>■ 00. Combining this with equation (fT 5 ]) . however, we can see that our success probability 
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must converge to 0 as /c goes to infinity. 

Thus, as we have shown above, in either case we get that our success probability goes to 
0 as /c goes to infinity, i.e. our rounding process succeeds with probability o(l) in k exactly as 
claimed. □ 
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